Serveur d'exploration sur SGML

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Information Retrieval and Structured Documents

Identifieur interne : 001B25 ( Main/Exploration ); précédent : 001B24; suivant : 001B26

Information Retrieval and Structured Documents

Auteurs : Yves Chiaramella [France]

Source :

RBID : ISTEX:B8BDC4B2184CBB1F85C4AFADFFEEE2C8601A399A

Abstract

Abstract: Standard Information Retrieval considers documents as atomic units of information that are indexed and retrieved as a whole. Modern evolution of document design and storage have since a long time introduced more elaborate representations of documents; standards such as SGML, then HTML and now XML are of course major contributions in this domain. These standards underly today evolutions towards modern electronic documents. In this context, retrieving structured documents refers to index and retrieve information according to a given structure of documents. This means that documents are no longer considered as atomic entities, but as aggregates of interrelated objects that can be retrieved separately: given a retrieval query, one may retrieve the set of document components that are most relevant to this query. In this chapter we shall first emphasise some aspects which, in our opinion, relate explicit use of document structure to interactive retrieval performances, such as efficiency while browsing or querying information. In a second step we shall investigate two classes of implementation approaches dealing with indexing and retrieving structured documents: passage retrieval and explicit use of hierarchical structures of documents.

Url:
DOI: 10.1007/3-540-45368-7_12


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Information Retrieval and Structured Documents</title>
<author>
<name sortKey="Chiaramella, Yves" sort="Chiaramella, Yves" uniqKey="Chiaramella Y" first="Yves" last="Chiaramella">Yves Chiaramella</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:B8BDC4B2184CBB1F85C4AFADFFEEE2C8601A399A</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-45368-7_12</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-1GD90M1X-7/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002F56</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">002F56</idno>
<idno type="wicri:Area/Istex/Curation">002515</idno>
<idno type="wicri:Area/Istex/Checkpoint">001954</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">001954</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Chiaramella Y:information:retrieval:and</idno>
<idno type="wicri:Area/Main/Merge">001B69</idno>
<idno type="wicri:Area/Main/Curation">001B25</idno>
<idno type="wicri:Area/Main/Exploration">001B25</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Information Retrieval and Structured Documents</title>
<author>
<name sortKey="Chiaramella, Yves" sort="Chiaramella, Yves" uniqKey="Chiaramella Y" first="Yves" last="Chiaramella">Yves Chiaramella</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>CLIPS Laboratory, Grenoble Cedex 6, BP 53. 38041</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Standard Information Retrieval considers documents as atomic units of information that are indexed and retrieved as a whole. Modern evolution of document design and storage have since a long time introduced more elaborate representations of documents; standards such as SGML, then HTML and now XML are of course major contributions in this domain. These standards underly today evolutions towards modern electronic documents. In this context, retrieving structured documents refers to index and retrieve information according to a given structure of documents. This means that documents are no longer considered as atomic entities, but as aggregates of interrelated objects that can be retrieved separately: given a retrieval query, one may retrieve the set of document components that are most relevant to this query. In this chapter we shall first emphasise some aspects which, in our opinion, relate explicit use of document structure to interactive retrieval performances, such as efficiency while browsing or querying information. In a second step we shall investigate two classes of implementation approaches dealing with indexing and retrieving structured documents: passage retrieval and explicit use of hierarchical structures of documents.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Auvergne-Rhône-Alpes</li>
<li>Rhône-Alpes</li>
</region>
</list>
<tree>
<country name="France">
<region name="Auvergne-Rhône-Alpes">
<name sortKey="Chiaramella, Yves" sort="Chiaramella, Yves" uniqKey="Chiaramella Y" first="Yves" last="Chiaramella">Yves Chiaramella</name>
</region>
<name sortKey="Chiaramella, Yves" sort="Chiaramella, Yves" uniqKey="Chiaramella Y" first="Yves" last="Chiaramella">Yves Chiaramella</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001B25 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001B25 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Informatique
   |area=    SgmlV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:B8BDC4B2184CBB1F85C4AFADFFEEE2C8601A399A
   |texte=   Information Retrieval and Structured Documents
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jul 1 14:26:08 2019. Site generation: Wed Apr 28 21:40:44 2021